Learning Correction Grammars

نویسندگان

  • Lorenzo Carlucci
  • John Case
  • Sanjay Jain
چکیده

We investigate a new paradigm in the context of learning in the limit, namely, learning correction grammars for classes of computably enumerable (c.e.) languages. Knowing a language may feature a representation of it in terms of two grammars. The second grammar is used to make corrections to the first grammar. Such a pair of grammars can be seen as a single description of (or grammar for) the language. We call such grammars correction grammars. Correction grammars capture the observable fact that people do correct their linguistic utterances during their usual linguistic activities. We show that learning correction grammars for classes of c.e. languages in theTxtExmodel (i.e., converging to a single correct correction grammar in the limit) is sometimes more powerful than learning ordinary grammars even in the TxtBc-model (where the learner is allowed to converge to infinitely many syntactically distinct but correct conjectures in the limit). For each n ≥ 0, there is a similar learning advantage, again in learning correction grammars for classes of c.e. languages, but where we compare learning correction grammars that make n+ 1 corrections to those that make n corrections. The concept of a correction grammar can be extended into the constructive transfinite, using the idea of counting-down from notations for transfinite constructive ordinals. This transfinite extension can also be conceptualized as being about learning Ershov-descriptions for c.e. languages. For u a notation in Kleene’s general system (O,<o) of ordinal notations for constructive ordinals, we introduce the concept of an u-correction grammar, where u is used to bound the number of corrections that the grammar is allowed to make. We prove a general hierarchy result: if u and v are notations for constructive ordinals such that u <o v, then there are classes of c.e. languages that can be TxtEx-learned by conjecturing v-correction grammars but not by conjecturing u-correction grammars. ? Lorenzo Carlucci and John Case were supported in part by NSF Grant number CCR-0208616 at the University of Delaware. Lorenzo Carlucci was also supported in part by grant number 13393 of the John Templeton Foundation, and by a Telecom Italia “Progetto Italia” Fellowship at the Scuola Normale Superiore di Pisa. Sanjay Jain was supported in part by NUS grant numbers R252-000-308-112 and R252-000-212-112.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Learning unification-based natural language grammars

Practical text processing systems need wide covering grammars. When parsing unrestricted language, such grammars often fail to generate all of the sentences that humans would judge to be grammatical. This problem undermines successful parsing of the text and is known as undergeneration. There are two main ways of dealing with undergeneration: either by sentence correction, or by grammar correct...

متن کامل

Acquiring Plausible Uni cation-Based Grammars using Model-Based and Data-Driven Learning

Undergeneration is a problem that undermines successful parsing of unrestricted texts. A popular solution to this problem is automatic grammar correction (or machine learning of grammar). Broadly speaking, grammar correction approaches can be classiied as being either data-driven, or model-based. Data-driven learners use data-intensive methods to acquire grammar. They typically use grammar form...

متن کامل

Correction Grammars for Error Handling in a Speech Dialog System

Speech recognition errors are inevitable in a speech dialog system. This paper presents an error handling method based on correction grammars which recognize the correction utterances which follow a recognition error. Correction grammars are dynamically created from existing grammars and a set of correction templates. We also describe a prototype dialog system which incorporates this error hand...

متن کامل

Concept Revision of Age, Motivation, and Error Correction in Second Language Learning

The current review article investigates some variables contributing to English language teaching and learning. Three factors of age, motivation and error correction have been of importance in English language curricula in language centres. Some studies have been conducted to investigate various effects of these three components on English language acquisition, those studies, however, may lack d...

متن کامل

Two Experiments on Learning Probabilistic Dependency Grammars from Corpora

Introduction We present a scheme for learning prohabilistic dependency grammars from positive training examples plus constraints on rules. In particular we present the results of two experiments. The first, in which the constraints were minimal, was unsuccessful. The second, with significant constraints, was successful within the bounds of the task we had set. We will explicate dependency gramm...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • J. Symb. Log.

دوره 74  شماره 

صفحات  -

تاریخ انتشار 2007